Linguistic Processing in Lattice-Based Taxonomy Construction
نویسندگان
چکیده
Building a lattice-based taxonomy over a text corpus with formal concept analysis (FCA) methods requires preliminary text processing that would enable construction of a context. We consider several natural language processing methods aimed at automatic attribute acquisition from texts. In particular, we derive attributes of three types: frequent words, latent topics and named entities. Afterwards, we construct a context for each type taking documents in the corpus as a set of objects. Then the corresponding concept lattices are built and pruned with the help of stability index in order to improve the readability of the diagrams. The proposed technique is illustrated on a collection of 26 texts in English dealing with political domain. In this case, the technique serves as a tool for deeper understanding of the interests of different political actors producing political texts by clarifying the connections between notions they use in them.
منابع مشابه
An Irregular Lattice Pore Network Model Construction Algorithm
Pore network modeling uses a network of pores connected by throats to model the void space of a porous medium and tries to predict its various characteristics during multiphase flow of various fluids. In most cases, a non-realistic regular lattice of pores is used to model the characteristics of a porous medium. Although some methodologies for extracting geologically realistic irregular net...
متن کاملSchool-tagging: interactive language exercises in classrooms
We present a prototype of a novel online platform for promoting playful learning exercises in classrooms, allowing teachers to engage with students in an interactive way. Differently from typical e-learning environments, it is the teacher, not the machine, who leads the learning activity, i.e., she is able to monitor students’ individual and aggregated answers and provide them real-time feedbac...
متن کاملA Linguistic Service Ontology for Language Infrastructures
This paper introduces conceptual framework of an ontology for describing linguistic services on network-based language infrastructures. The ontology defines a taxonomy of processing resources and the associated static language resources. It also develops a sub-ontology for abstract linguistic objects such as expression, meaning, and description; these help define functionalities of a linguistic...
متن کاملIdentifying Assertions in Text and Discourse: The Presentational Relative Clause Construction
In this paper we investigate the Presentational Relative Clause (PRC) construction. In both the linguistic and NLP literature, relative clauses have been considered to contain background information that is not directly relevant or highly useful in semantic analysis. In text summarization in particular, the information contained in the relative clauses is often removed, being viewed as non-cent...
متن کاملSelection of chromite processing plant site using fuzzy analytic hierarchy process (FAHP)
Based on existence of the chromite deposits in the Sistan and Baluchestan province in Iran, and also various applications of chromite in different industries, it is expected that the establishment of chromite processing plant is required in the erelong. The geographical location of a processing plant can have a strong influence on the success of an industrial venture. The processing plant site ...
متن کامل